Rohnert Park
Can OpenAI o1 outperform humans in higher-order cognitive thinking?
Latif, Ehsan, Zhou, Yifan, Guo, Shuchen, Shi, Lehong, Gao, Yizhu, Nyaaba, Matthew, Bewerdorff, Arne, Yang, Xiantong, Zhai, Xiaoming
This study evaluates the performance of OpenAI's o1-preview model in higher-order cognitive domains, including critical thinking, systematic thinking, computational thinking, data literacy, creative thinking, logical reasoning, and scientific reasoning. Using established benchmarks, we compared the o1-preview models's performance to human participants from diverse educational levels. o1-preview achieved a mean score of 24.33 on the Ennis-Weir Critical Thinking Essay Test (EWCTET), surpassing undergraduate (13.8) and postgraduate (18.39) participants (z = 1.60 and 0.90, respectively). In systematic thinking, it scored 46.1, SD = 4.12 on the Lake Urmia Vignette, significantly outperforming the human mean (20.08, SD = 8.13, z = 3.20). For data literacy, o1-preview scored 8.60, SD = 0.70 on Merk et al.'s "Use Data" dimension, compared to the human post-test mean of 4.17, SD = 2.02 (z = 2.19). On creative thinking tasks, the model achieved originality scores of 2.98, SD = 0.73, higher than the human mean of 1.74 (z = 0.71). In logical reasoning (LogiQA), it outperformed humans with average 90%, SD = 10% accuracy versus 86%, SD = 6.5% (z = 0.62). For scientific reasoning, it achieved near-perfect performance (mean = 0.99, SD = 0.12) on the TOSLS,, exceeding the highest human scores of 0.85, SD = 0.13 (z = 1.78). While o1-preview excelled in structured tasks, it showed limitations in problem-solving and adaptive reasoning. These results demonstrate the potential of AI to complement education in structured assessments but highlight the need for ethical oversight and refinement for broader applications.
- North America > United States > Georgia > Clarke County > Athens (0.14)
- Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
- Asia > China > Jiangsu Province > Nanjing (0.04)
- (5 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Questionnaire & Opinion Survey (1.00)
- Education > Educational Setting > Higher Education (1.00)
- Education > Curriculum > Subject-Specific Education (1.00)
- Health & Medicine (0.93)
- Education > Assessment & Standards (0.68)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.72)
AI sports betting platforms' breaches likely impacting March Madness wagers
Fox News Flash top sports headlines are here. Check out what's clicking on Foxnews.com. College basketball fans from across the country spent the past couple of weeks keeping a close eye on the NCAA Division I men's and women's basketball tournaments. Millions of sports enthusiasts filled out and submitted brackets with hopes their particular games' predictions would become true. The annual basketball tournament seemingly always sparks a noticeable amount of excitement across the sports world, but it also attracts the casual fan and those who might not normally participate in sports gambling.
- Asia > China (0.06)
- North America > United States > Connecticut > Tolland County > Storrs (0.05)
- North America > United States > California > Sonoma County > Rohnert Park (0.05)
Visual Response to Emotional State of User Interaction
Marhamati, Nina, Creston, Sena Clara
This work proposes an interactive art installation "Mood spRing" designed to reflect the mood of the environment through interpretation of language and tone. Mood spRing consists of an AI program that controls an immersive 3D animation of the seasons. If the AI program perceives the language and tone of the users as pleasant, the animation progresses through idealized renditions of seasons. Otherwise, it slips into unpleasant weather and natural disasters of the season. To interpret the language and tone of the user interaction, hybrid state-of-the-art emotion detection methods are applied to the user audio and text inputs. The emotional states detected separately from tone and language are fused by a novel approach that aims at minimizing the possible model disparity across diverse demographic groups.
- North America > United States > California > Sonoma County > Rohnert Park (0.05)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
ChatGPT: The End of Online Exam Integrity?
This study evaluated the ability of ChatGPT, a recently developed artificial intelligence (AI) agent, to perform high-level cognitive tasks and produce text that is indistinguishable from human-generated text. This capacity raises concerns about the potential use of ChatGPT as a tool for academic misconduct in online exams. The study found that ChatGPT is capable of exhibiting critical thinking skills and generating highly realistic text with minimal input, making it a potential threat to the integrity of online exams, particularly in tertiary education settings where such exams are becoming more prevalent. Returning to invigilated and oral exams could form part of the solution, while using advanced proctoring techniques and AI-text output detectors may be effective in addressing this issue, they are not likely to be foolproof solutions. Further research is needed to fully understand the implications of large language models like ChatGPT and to devise strategies for combating the risk of cheating using these tools. It is crucial for educators and institutions to be aware of the possibility of ChatGPT being used for cheating and to investigate measures to address it in order to maintain the fairness and validity of online exams for all students.
- Asia > India (0.05)
- Oceania > New Zealand > North Island > Auckland Region > Auckland (0.04)
- North America > United States > Ohio (0.04)
- North America > United States > California > Sonoma County > Rohnert Park (0.04)
- Education > Educational Setting > Online (1.00)
- Information Technology > Security & Privacy (0.93)
- Government > Regional Government > North America Government > United States Government (0.46)
Task scheduling system for UAV operations in indoor environment
Khosiawan, Yohanes, Park, Young Soo, Moon, Ilkyeong, Nilakantan, Janardhanan Mukund, Nielsen, Izabela
Application of UAV in indoor environment is emerging nowadays due to the advancements in technology. UAV brings more space-flexibility in an occupied or hardly-accessible indoor environment, e.g., shop floor of manufacturing industry, greenhouse, nuclear powerplant. UAV helps in creating an autonomous manufacturing system by executing tasks with less human intervention in time-efficient manner. Consequently, a scheduler is one essential component to be focused on; yet the number of reported studies on UAV scheduling has been minimal. This work proposes a methodology with a heuristic (based on Earliest Available Time algorithm) which assigns tasks to UAVs with an objective of minimizing the makespan. In addition, a quick response towards uncertain events and a quick creation of new high-quality feasible schedule are needed. Hence, the proposed heuristic is incorporated with Particle Swarm Optimization (PSO) algorithm to find a quick near optimal schedule. This proposed methodology is implemented into a scheduler and tested on a few scales of datasets generated based on a real flight demonstration. Performance evaluation of scheduler is discussed in detail and the best solution obtained from a selected set of parameters is reported.
- Europe > Denmark > North Jutland > Aalborg (0.04)
- Asia > South Korea > Seoul > Seoul (0.04)
- North America > United States > South Carolina > Horry County > Myrtle Beach (0.04)
- (2 more...)
- Aerospace & Defense > Aircraft (0.67)
- Transportation > Air (0.67)
- Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (1.00)